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ABSTRACT 

The  focus  of  tracking  applications  has  traditionally  centred  on  kinematic  state 
estimation.  However,  attribute  information  has  the  potential  to  not  only  pro¬ 
vide  identity  and  class  information,  but  it  may  also  improve  data  associa¬ 
tion  and  kinematic  tracking  performance.  Bayesian  Belief  Networks  provide 
a  framework  for  specifying  the  dependencies  between  kinematic  and  attribute 
states.  Algorithms  based  on  this  framework  are  developed  for  joint  kinematic 
and  attribute  data  association,  kinematic  tracking,  attribute  state  estimation, 
and  joint  kinematic  and  attribute  tracking.  The  algorithms  are  demonstrated 
using  simulated  tracking  scenarios. 
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Kinematic  and  Attribute  Fusion  Using  a  Bayesian  Belief 

Network  Framework 


EXECUTIVE  SUMMARY 

The  kinematic  and  attribute  states  of  a  target  are  not  generally  independent  of  each 
other.  It  is  often  possible  to  infer  information  about  one  from  knowledge  of  the  other.  The 
use  of  both  can  improve  both  data  association  and  state  estimation.  However,  kinematic 
states  are  generally  continuous  and  time  variant,  whereas  attribute  states  are  often  discrete 
and  may  be  static  or  time  variant.  Traditionally,  the  techniques  used  for  the  estimation 
of  one  may  not  be  suitable  for  the  other. 

A  Bayesian  Belief  Network  (BBN)  is  a  useful  tool  for  modelling  the  joint  probability 
of  dependent  continuous  and  discrete  variables.  As  such,  it  can  be  used  to  specify  the 
uncertain  dependencies  between  the  kinematic  and  attribute  states  of  the  target  tracking 
and  recognition  problem,  and  it  provides  a  framework  from  which  data  association  and 
target  state  estimation  algorithms  may  be  developed.  Different  network  topologies  may  be 
used  to  develop  different  algorithms,  that  is,  the  design  of  the  algorithms  will  be  influenced 
by  the  choice  of  network  variables  and  the  relationship  between  them. 

The  BBN  framework  has  been  applied  to  the  development  of  a  joint  kinematic  and 
attribute  data  association  algorithm.  The  use  of  both  kinematic  and  attribute  sensor 
data  and  target  states  improves  the  ability  of  the  algorithm  to  discriminate  between  mul¬ 
tiple  measurements  and  targets.  Kinematic  and  attribute  state  estimation  or  tracking 
algorithms  have  also  been  developed  using  this  framework.  Under  linear  and  Gaussian 
assumptions,  it  is  shown  that  the  kinematic  estimator  reverts  to  the  Kalman  filter.  An 
algorithm  for  estimating  a  target’s  identity  and  class  using  attribute  measurements  is 
also  developed.  The  performance  of  these  algorithms  is  demonstrated  using  a  simulated 
target  tracking  and  recognition  application.  The  improvement  in  the  data  association 
performance  due  to  the  availability  of  the  attribute  measurements  and  target  attribute 
estimates  is  shown  to  improve  the  kinematic  tracking  performance,  particularly  in  dense 
target  scenarios. 

A  joint  kinematic  and  attribute  tracking  algorithm  is  developed  using  a  BBN  that 
captures  the  dependencies  between  the  target’s  kinematic  and  attribute  states.  Demon¬ 
stration  of  the  algorithm  using  simulated  scenarios  shows  that  the  target’s  class  or  type 
can  be  estimated  from  the  kinematic  behaviour  of  the  target.  The  knowledge  of  the  tar¬ 
get’s  class  may  also  improve  the  kinematic  tracking  performance  through  the  selection  of 
the  most  appropriate  kinematic  model  for  that  class. 
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1  Introduction 


The  objective  of  any  effective  surveillance  system  is  to  generate  a  picture  of  the  situa¬ 
tion  of  interest  by  locating  and  recognising  all  the  targets  of  interest  within  its  volume  of 
coverage.  Traditional  techniques  treat  this  as  two  separate  problems  that  are  executed  se¬ 
quentially.  The  first  problem  is  to  estimate  the  position,  speed  and  heading  of  the  targets 
using  the  available  kinematic  measurements.  This  process  is  known  as  target  tracking. 
The  second  problem  is  target  recognition,  in  which  the  attribute  measurements  from  the 
sensors  and  contextual  information  are  used  to  determine  the  most  likely  identity  (ID), 
class,  category,  and  other  information  for  the  targets  being  tracked.  The  identity  of  a 
target  refers  to  its  allegiance,  for  example,  friend,  assumed  friend,  neutral,  suspect  and 
hostile.  A  target’s  class  is  the  type  of  platform,  which  may  be  determined  to  varying 
degrees  of  detail,  and  its  category  is  one  of  air,  surface,  ground,  sub-surface  or  space. 

The  field  of  target  tracking  is  well  established,  finding  its  origins  in  single  radar  track¬ 
ing  [Blackman  &  Popoli  1999].  However,  automatic  target  recognition,  which  includes 
automatic  target  or  track  identification  and  automatic  target  classification,  has  attracted 
increased  interest  since  the  advent  of  multiple  sensor  tracking  and  data  fusion.  The  fusion 
of  data  from  multiple  sensors  provides  disparate  attribute  information  that  may  be  use¬ 
ful  for  discriminating  between  different  classes  and  identities.  For  example,  a  Secondary 
Surveillance  Radar  (SSR)  provides  identification  codes  from  transponder  interrogations 
and  an  Electronic  Surveillance  (ES)  sensor  contributes  to  platform  classification  through 
the  detection  and  identification  of  emitters. 

The  separation  of  kinematic  tracking  and  attribute  state  estimation  assumes  that  the 
two  are  independent.  Although  this  assumption  simplifies  the  problem,  potentially  signif¬ 
icant  information  from  one  of  target  tracking  or  target  recognition  is  not  available  for  the 
other.  In  particular,  the  location,  altitude  and  speed  of  a  target,  and  the  target’s  ability  to 
manoeuvre,  may  provide  insight  into  the  target’s  category,  identity  or  class.  For  example, 
a  Hornet  F/A-18  may  reach  speeds  in  excess  of  Mach  1  and  perform  manoeuvres  with 
acceleration  greater  than  5  g,  whereas  a  Boeing  747  will  not.  Similarly,  knowledge  of  a 
platform’s  class  may  allow  a  tracker  to  select  more  appropriate  dynamic  models  to  improve 
its  kinematic  tracking  performance.  The  use  of  both  kinematic  and  attribute  information 
for  associating  sensor  measurements  or  plots  to  tracks  may  provide  greater  discrimination 
between  data  and  reduce  the  incidence  of  incorrect  association. 

A  target’s  kinematic  state  space  and  kinematic  measurements  are  generally  continu¬ 
ous  valued  entities,  whereas  the  target’s  attributes  and  the  sensor  attribute  measurements 
may  be  either  continuous  or  discrete.  For  example,  Radar  Cross  Section  (RCS)  and  target 
size  may  be  continuous,  but  the  platform  type  and  category  are  discrete.  Furthermore, 
both  the  measurements  and  target  state  estimates  are  uncertain.  A  Bayesian  Belief  Net¬ 
work  (BBN)  has  the  capability  to  accommodate  both  continuous  variables  to  represent  a 
target’s  kinematic  information  and  discrete  variables  to  represent  its  identity  and  class  in¬ 
formation,  making  it  a  suitable  candidate  for  target  tracking  and  recognition  applications. 
The  underlying  Bayesian  probability  calculus  provides  the  mechanism  for  handling  state 
uncertainty  and  errors.  Thus,  BBNs  provide  a  rigorous  basis,  or  framework,  for  developing 
(joint)  tracking  and  recognition  algorithms. 
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This  report  addresses  the  development  of  kinematic  and  attribute  data  association 
and  state  estimation  algorithms,  using  the  BBN  as  the  basis  for  defining  the  state  and 
measurement  variables  and  the  dependencies  between  them.  The  BBN  is  not  directly  im¬ 
plemented  as  the  solution,  rather  it  may  be  thought  of  as  a  tool  used  for  the  development  of 
the  algorithms,  with  the  network  topology  defining  the  posterior  probability  of  the  output 
state  variables.  This,  and  the  method  used  to  determine  the  most  probable  values  for  the 
output  state  variables,  dictate  the  design  of  the  resulting  algorithm.  This  approach  is  used 
to  develop  data  association  algorithms  using  both  kinematic  and  attribute  measurements 
and  state  variables,  a  kinematic  state  estimation  algorithm  using  kinematic  measurements, 
and  an  attribute  state  estimation  algorithm  using  attribute  measurements.  The  latter  is 
demonstrated  using  a  target  identification  and  classification  application.  Finally,  a  joint 
target  tracking  and  classification  algorithm,  in  which  both  the  kinematic  and  attribute 
measurements  are  used  to  jointly  estimate  the  target  dynamics  and  class,  is  developed. 
Although  the  algorithms  are  developed  for  target  tracking  and  recognition  applications, 
the  BBN  formulation  allows  the  dependencies  between  any  state  and  measurement  vari¬ 
ables  to  be  defined,  and  provides  a  mechanism  for  developing  similar  algorithms  for  other 
scenarios  and  applications. 

Following  this  introduction,  the  relevant  background  material  is  presented  in  Section  2, 
and  a  brief  description  of  BBNs  follows  in  Section  3.  The  data  association  algorithms  are 
developed  and  evaluated  in  Section  4.  Similarly,  the  state  estimation  algorithms  and  the 
joint  tracking  and  recognition  algorithms  are  presented  in  Sections  5  and  6,  respectively. 
Finally,  the  conclusions  are  presented  in  Section  7. 


2  Background 

In  general,  a  typical  sensor  will  provide  at  least  partial  kinematic  information.  It  may 
also  provide  attribute  data  that  supports  the  estimation  of  non-kinematic  states,  such  as 
identity,  class,  and  category.  For  example,  primary  radars  may  provide  range,  bearing,  ele¬ 
vation  and  Doppler  velocity  measurements  and  may  also  provide  classification  information 
through  Non-Cooperative  Target  Recognition  (NCTR)  techniques  that  use  the  propeller 
or  jet  engine  modulation  of  the  Doppler  frequency.  Specialist  radar  modes,  such  as  High 
Range  Resolution  (HRR)  and  Inverse  Synthetic  Aperture  Radar  (ISAR)  provide  additional 
classification  capabilities.  Secondary  radars  are  used  extensively  for  Air  Traffic  Control 
(ATC)  in  the  civilian  domain  and  Identification  Friend  or  Foe  (IFF)  for  target  identifi¬ 
cation  in  military  applications.  These  sensors  provide  bearing  and  range  measurements, 
and  identification  and  altitude  information  through  Selective  Identification  Feature  (SIF) 
codes.  Electronic  Surveillance  (ES)  sensors,  such  as  Electronic  Support  Measures  (ESM) 
and  Radar  Warning  Receivers  (RWRs)  may  provide  bearing  and  elevation  measurements, 
and  emitter  and  platform  classification  information.  An  infrared  (IR)  sensor,  such  as  a 
Forward  Looking  Infrared  (FLIR)  or  a  Infrared  Search  and  Track  (IRST),  provides  accu¬ 
rate  bearing,  elevation  and,  if  fitted  with  a  laser  range  finder,  range  measurements.  The 
availability  of  target  size  information  may  contribute  to  target  recognition.  Tactical  Dig¬ 
ital  Information  Link  (TADIL),  Satellite  Communication  (SATCOM)  protocols,  such  as 
the  Integrated  Broadcast  System  (IBS),  and  other  data  communication  links  are  potential 
sources  of  attribute  information.  In  addition,  prior  or  contextual  information,  such  as  the 
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location  of  air-bases,  flight  paths  and  air  corridors,  may  also  contribute  to  the  estimation 
of  attribute  states. 

Although  both  kinematic  and  attribute  target  state  information  provide  useful  in¬ 
formation  for  compiling  the  situational  picture,  a  number  of  significant  differences  exist 
between  the  two.  A  target’s  kinematic  state,  which  is  primarily  its  position  and  velocity, 
is  generally  continuous  and  time  variant.  On  the  other  hand,  the  attribute  states  may  be 
either  continuous  or  discrete,  and  either  time  varying  or  time  invariant.  For  example,  RCS 
is  continuous  and  time  variant,  whereas  identity  is  discrete  and  time  invariant.  The  target 
attributes  may  be  categorised  as  either  evolutionary  or  non- evolutionary,  for  time  varying 
or  time  invariant,  respectively  [Blackman  Sz  Popoli  1999].  The  sensor  measurements  may 
be  similarly  categorised. 

The  two  key  challenges  are  to  successfully  combine  the  disparate  data  or  information, 
and  to  handle  the  errors  or  uncertainty  in  the  data.  Bayesian  techniques  and  Dempster- 
Shafer  theory  are  two  prevailing  approaches  that  are  able  to  accommodate  these  challenges. 
Bayesian  techniques  use  Bayesian  probability  theory  to  handle  the  uncertainty,  whereas 
Dempster-Shafer  introduces  the  concepts  of  support  and  plausibility  [Bogler  1987,  Simard, 
Couture  &  Bosse  1996].  Bayesian  probability  theory  provides  a  common  frame  of  reference 
for  the  kinematic  and  attribute  variables,  and  has  previously  been  exploited  for  joint 
tracking  and  classification  [Challa  &  Pulford  1999,  Gordon,  Masked  &  Kirubarajan  2002, 
Krieg  2003  a]. 

Bayesian  techniques  involve  determining  the  posterior  probability  of  the  variables  of 
interest,  and  then  applying  estimation  techniques,  such  as  maximum  a  posteriori  (MAP), 
to  determine  the  most  likely  state  from  the  posterior  distribution.  The  difficulty  is  defin¬ 
ing  the  posterior  probability,  which  may  involve  the  joint  probability  of  all  the  variables 
in  the  system.  Bayesian  Belief  Networks  (BBNs)  are  causal  belief  networks  that  model 
the  dependencies  between  the  variables,  represented  by  network  nodes,  using  Bayesian 
probability  [Pearl  1988],  and  only  require  the  user  to  know  the  dependence  relationships 
between  directly  connected  nodes  or  variables.  In  addition,  the  ability  to  accommodate 
continuous  variables  to  represent  kinematic  information  and  discrete  variables  to  represent 
identity  and  class  information  make  them  a  suitable  candidate  for  multiple  source  target 
tracking  and  recognition  applications  [Stewart  &  McCarty  1992],  However,  the  depen¬ 
dencies  between  attributes  generally  produce  non-singly  connected  networks  for  attributes 
that  evolve  over  time  [Hautaniemi,  Korpisaari  Sz  Saarinen  2000],  which  generally  require 
the  use  of  techniques  such  as  clustering,  conditioning  or  stochastic  simulation  to  guarantee 
convergence  to  the  global  solution  [Pearl  1988]. 

BBNs  have  been  applied  to  a  number  of  multiple  source  tracking  related  applications. 
For  example,  Korpisaari  Sz  Saarinen  [1999]  proposed  a  generalised  Bayesian  network  struc¬ 
ture  with  a  many-dimensional  association  vector  for  applying  attribute  data  in  a  Joint 
Probabilistic  Data  Association  (JPDA)  context,  and  Chang  Sz  Fung  [1997]  used  a  BBN  to 
incorporate  attribute  information  into  a  multiple  hypothesis  tracking  algorithm  in  which 
attributes  are  used  to  solve  both  the  association  and  track  update  problems.  Krieg  showed 
how  BBNs  may  be  used  as  a  basis  for  deriving  the  Kalman  filter  tracking  algorithm,  and 
used  BBNs  to  develop  an  attribute  estimation  algorithm  [Krieg  2002],  various  joint  kine¬ 
matic  and  attribute  data  association  algorithms  [Krieg  20036],  and  a  joint  kinematic  and 
attribute  tracking  solution  [Krieg  2003  a]. 
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3  Bayesian  Belief  Networks 

A  Bayesian  belief  network  (BBN)  is  a  causal  network  comprising  a  set  of  nodes  con¬ 
nected  by  directed  links  or  arcs.  The  nodes  of  the  network  represent  the  stochastic  variables 
of  a  system  or  domain,  and  the  directed  arcs  between  the  nodes  represent  the  direct  causal 
influences  between  the  variables  [Jensen  1996].  Connected  nodes  are  deemed  to  have  a 
cause  and  effect,  or  parent  and  child,  relationship,  with  the  direction  of  the  arc  being  from 
cause  to  effect,  or  parent  node  to  child  node.  Any  node  without  at  least  one  parent  node 
is  known  as  a  root  node,  and  any  node  with  no  child  nodes  is  referred  to  as  a  leaf  node. 

The  BBN  provides  a  complete  probabilistic  model  of  all  the  variables  in  the  domain  of 
interest.  Although  it  may  be  difficult  to  conceptualise  the  overall  interaction  between  all 
the  variables,  the  construction  of  the  BBN  is  based  on  the  local  interactions  between  the 
variables  that  directly  influence  one  another.  Therefore  it  is  only  necessary  to  determine 
the  conditional  probabilities  for  each  variable,  that  is,  the  probability  given  the  state  of 
each  of  its  direct  causes  or  parent  nodes  [Pearl  1988].  This  is  conceptually  simpler  than 
attempting  to  define  the  joint  probability  of  all  the  variables  in  the  domain. 

The  independence  between  the  variables  of  a  BBN  is  introduced  through  the  concept  of 
d-separation  [Jensen  1996].  Within  a  BBN,  some  knowledge  of  the  state  of  a  variable  may 
be  inferred  from  the  state  of  another  variable.  However,  if  the  states  of  all  the  variables 
in  all  the  paths  separating  the  two  variables  are  known,  then  the  knowledge  of  the  state 
of  one  will  not  provide  any  additional  insight  into  the  state  of  the  other,  and  the  two 
variables  become  independent.  Similarly,  if  the  state  of  a  particular  node  is  known,  then 
its  child  nodes  are  independent.  The  situation  differs  for  the  variables  sharing  a  common 
child  node;  the  parent  nodes  are  only  independent  if  there  is  no  knowledge  of  the  state 
of  the  child  or  any  of  its  descendants.  Another  distinction  is  that  only  partial,  or  soft, 
knowledge  is  necessary  for  dependence  in  this  case,  as  opposed  to  the  two  former,  where 
complete,  or  hard,  knowledge  is  required  for  independence. 

The  objective  is  to  determine  the  posterior  probability,  or  belief,  of  the  state  of  variables 
of  interest  in  the  network,  subject  to  the  application  of  the  available  knowledge  of  the  state 
of  a  subset  of  the  nodes  of  the  network;  herein  referred  to  as  evidence.  This  posterior 
probability  is  dynamically  calculated  from  the  static  prior  and  conditional  probabilities, 
and  any  evidence  that  has  been  applied  to,  and  propagated  through,  the  network.  By 
definition,  the  evidence  is  propagated  in  the  direction  of  the  causal  links  using  ir  messages 
and  against  the  direction  of  the  links  using  A  messages.  On  receipt  of  a  message  from 
one  of  the  nodes  directly  connected  to  it,  a  node  will  update  its  belief  and  propagate  new 
messages  to  each  of  the  other  nodes  directly  connected  to  it,  thereby  ensuring  the  evidence 
is  propagated  throughout  the  network.  The  belief  at  any  node  is  the  normalised  product 
of  the  A  and  n  information  at  that  node. 

In  this  report  it  is  assumed,  without  loss  of  generality,  that  evidence  is  only  applied  to 
the  network  as  leaf  nodes.  To  avoid  convergence  and  deadlock  issues  related  to  loops  in  the 
underlying  Markov  network,  propagation  is  only  considered  for  tree  structured  networks, 
that  is,  those  for  which  each  variable  has  only  one  parent  node.  Non-complying  networks 
are  converted  to  tree  structures  by  clustering  or  aggregating  nodes  [Pearl  1988,  Krieg 
2001].  Unless  stated  otherwise,  a  node  or  variable  is  denoted  by  an  uppercase  alphabetic 
character,  and  its  value  is  denoted  by  its  lowercase  equivalent. 
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7r( VN ) 


Figure  1:  Evidence  propagation  in  a  BBN 


3.1  Propagation  in  Bayesian  Belief  Networks 

To  illustrate  the  propagation  of  evidence  in  a  BBN,  consider  the  network  of  Figure  1. 
The  total  evidence  or  information,  ex,  influencing  the  proposition  that  the  node  X  has 
the  value  x,  that  is,  X  =  x,  is  separated  into  two  disjoint  subsets,  e\  denoting  the 
evidence  introduced  through  the  arc  between  the  node  X  and  its  parent  node,  W,  and 
e^,  denoting  the  evidence  introduced  to  the  node  X  through  the  arcs  between  it  and 
its  children,  Yj,...,Y/v.  The  specific  evidence  introduced  through  the  arc  to  child  Yj, 
z  =  1,2,  ...,7V,  is  denoted  e^y. . 

For  the  root  node,  W,  the  evidence  is  the  background  knowledge  or  prior  proba¬ 
bility  of  all  the  possible  values  for  that  node,  p  {W  =  w) ,  Vrc,  which  is  denoted  p  (w)  for 
notational  expediency.  In  the  case  of  the  leaf  nodes,  Y,  i  =  1, . . . ,  N,  ey  may  take  one  of 
three  forms,  namely: 

1.  no  evidence,  where  the  possible  values  of  the  node  Y  are  uniformly  distributed, 
that  is,  p  (yi)  =  jj—,  where  Nyi  is  the  number  of  possible  values  of  Y;  for  discrete 
variables  or  the  difference  between  the  maximum  and  minimum  values  for  continuous 
variables; 

2.  soft  evidence,  where  the  probability  of  each  possible  value  of  Yj  is  provided,  generally 
as  a  probability  mass  function  for  discrete  variables  or  a  probability  density  function 
for  continuous  variables;  or 
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3.  hard  evidence,  where  the  value  of  the  node  Y%  is  known,  that  is,  p  (jji)  =  Sy.yt,  where 
y\  is  the  known  value  of  Yl  and  5y.yi  is  the  Kronecker  Delta  function,  which  equals  1 
when  r/i  =  y'i  and  zero  otherwise. 


Assuming  that  ex  and  ex  are  independent,  which  is  supported  by  the  concept  of 
d-separation  [Jensen  1996],  and  applying  Bayes’  Rule,  the  belief  that  X  =  x  is 


Bel(x)  =  p  (x\ex,e^) 


p  (ex\x)  p  (x\e+) 


where  is  the  summation  over  all  the  possible  values  of  the  node  X.  Defining  ir(x)  = 
p  (x\ex)  and  X(x)  =  p  (e^|x),  this  becomes 


Bel(x)  =  Q!7r(x)A(x), 


(1) 


where  A(x)  is  the  likelihood  representing  the  diagnostic  or  retrospective  support  for  the 
proposition  X  =  x,  tt(x)  is  the  prior  probability  representing  the  causal  or  predictive 
support  for  X  =  x,  and  a  is  a  normalising  constant. 

Consider  the  propagation  of  the  A  messages  in  Figure  1.  The  retrospective  support 
provided  by  the  evidence  on  each  leaf  node  Yi,  i  =  1, ...  ,1V  is  contained  in  the  message 
A Yi(x),  which  is  defined  as 

xYi(x)  =  ^2p{yi \x)X(yi),  (2) 

Vi 

where  p  (yi\x)  is  the  probability  that  Y%  =  y±  conditioned  on  X  =  x.  The  total  retrospective 
support  for  the  proposition  X  =  x  is  obtained  by  aggregating  the  retrospective  support 
from  all  the  child  nodes  of  X  using 

N 

Kx)  =  nAy;(x)-  (3) 

i— 1 

The  retrospective  support  is  then  propagated  to  the  parent  of  X  according  to 

Kw)  =  A(x).  (4) 


The  predictive  support  is  propagated  by  the  n  messages  in  the  direction  of  the  links. 
The  support  provided  by  the  root  node  W  is  simply  the  prior  probability  of  W,  that  is, 
7 t(w)  =  p  (w).  This  is  propagated  to  the  node  X  using 

7r(a:)  =  p  (x|u;)  7r(u;).  (5) 

W 


From  X,  the  predictive  support  is  propagated  to  the  child  nodes  of  X ,  namely  Yi,  i  = 
1, . . . ,  N,  as 


N 

*Yi(x)=  Ayfc(x)vr(x) 

k=l\i 

where  \  denotes  the  setminus  operator,  that  is,  J 
k  =  1, . . . ,  i  —  1,  i  +  1, . . . ,  IV. 


Bel(x) 

XYi(x) 1 


(6) 


k=i\i  is  the  product  over  the  values 
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Having  now  propagated  both  the  predictive  and  retrospective  support  throughout  the 
BBN,  the  posterior  probability  of  each  possible  value  for  each  node  may  be  calculated 
as  the  normalised  product  of  the  predictive  and  retrospective  support  of  that  node.  The 
process  is  repeated  whenever  further  evidence  is  applied  to  the  network. 


4  Data  Association 

Traditional  data  association  using  kinematic  data  provides  limited  discrimination  in 
the  measurement  space,  which  is  usually  of  a  lower  dimension  than  the  track  state  space. 
Although  sufficient  for  sparse  target  scenarios,  dense  target  scenarios  may  produce  am¬ 
biguous  solutions  that  have  the  potential  to  lead  to  poor  association  performance.  The 
use  of  attribute  data  has  the  potential  to  increase  the  discrimination  in  the  measurement 
space.  For  example,  an  ESM  system  may  provide  bearing  measurements  and  emitter  iden¬ 
tification  information.  The  bearing  alone  often  provides  inadequate  discrimination,  and 
the  additional  discrimination  provided  by  the  emitter  identification  has  the  potential  to 
resolve  some,  if  not  all,  the  ambiguities.  However,  a  consistent  distance  measure  must  be 
applied  to  all  elements  used  in  the  association.  The  traditional  squared  distance  metric 
used  for  kinematic  data  does  not  readily  apply  to  attribute  data.  These  distance  metrics 
are  a  simplification  of  a  probability  based  on  Gaussian  distributions,  where  the  squared 
distance  appears  in  the  exponent.  In  the  case  where  all  stochastic  variables  are  Gaussian 
distributed,  the  smallest  distance  corresponds  to  the  maximum  probability. 

This  section  considers  two  approaches  to  data  association,  namely  unconstrained  and 
constrained.  The  former  treats  all  measurements  at  a  particular  time,  or  within  a  single 
scan,  independently.  As  such,  the  association  is  unconstrained,  and  is  referred  to  here  as 
unconstrained  measurement-to-track  association.  The  latter  treats  all  the  measurements 
at  a  particular  time,  or  from  a  single  scan,  as  a  block  of  measurements.  This  relaxation 
of  the  assumption  of  independence  between  the  measurements  allows  the  introduction  of 
constraints  on  the  association  solution,  such  as  permitting  only  one  measurement  from 
each  sensor  at  a  particular  time  or  scan  to  be  associated  with  any  one  track.  This  is 
referred  to  here  as  constrained  measurement-to-track  association. 


4.1  Unconstrained  Measurement-to- Track  Association 

In  unconstrained  data  association,  each  measurement  is  treated  in  isolation,  and  the 
problem  becomes  one  of  associating  a  single  measurement  with  any  one  of  M  tracks  at 
time  t.  This  is  modelled  by  the  BBN  of  Figure  2,  where  the  measurement  is  represented 
by  the  node  Zt,  and  the  states  of  the  M  tracks  are  represented  by  each  of  the  nodes 
X^\ . . . , x[M\  The  association  node  Kt  represents  a  discrete  variable  whose  value  is  the 
number  of  the  track  to  which  the  measurement  is  associated,  that  is,  a  track  index  for 
that  measurement.  The  association  problem  is  now  simply  one  of  estimating  the  state  of 
Kt .  This  involves  determining  the  posterior  probabilities  for  each  possible  state  of  Kt  and 
using  the  maximum  a  posteriori  approach  of  selecting  the  state  with  the  greatest  proba¬ 
bility.  This  is  expressed  mathematically  as  argmax  p  (jct\zt,  %t  \  ■  ■  ■ ,  .  Although  the 

formulation  in  Figure  2  shows  a  single  measurement  at  time  t,  this  does  not  restrict  the 
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Figure  2:  BBN  representation  of  the  data  association  problem 


Figure  3:  Clustered  BBN  representation  of  the  data  association  problem 


problem  to  a  single  measurement  at  any  specific  time;  simultaneous  measurements  may 
be  processed  sequentially. 

The  BBN  of  Figure  2  is  not  a  tree  structure  by  virtue  of  the  multiple  parents  of  the 
node  Zt.  Clustering  [Pearl  1988,  Krieg  2001]  is  used  to  obtain  the  structure  of  Figure  3, 
in  which  the  probability  of  the  new  composite  variable  At  represents  the  joint  probability 
of  the  variables  X^\  . . . ,  X and  Kt. 

Assuming  that  the  prior  probabilities  of  the  track  states  and  Kt  are  independent,  which 
is  reasonable  if  no  measurement  is  associated  with  more  than  one  track,  the  posterior 
probability  that  At  will  be  in  a  particular  state  is  the  belief  of  that  state,  namely 

p  (at\zt)  =  Bel(at)  =  aX(at)n(at),  (7) 

where  it  (at)  is  the  prior  probability 

M 

p M  =  p (41}> •  ■  =  p (h)  n  p (x!m))  • 

m= 1 


(8) 


DSTO-RR-0315 


The  track  states  are  simply  the  state  of  the  most  recent  update  extrapolated  to  the  time 
t,  that  is,  to  the  time  of  the  current  measurement.  The  extrapolated  or  predicted  track 
state  probabilities  may  be  more  accurately  denoted  p  (x^ ,  where  the  posterior 

probability’s  dependence  on  the  set  of  measurements  up  to  the  time  t  —  1,  that  is,  Z*  , 
is  explicit  (or,  more  accurately,  the  subset  of  measurements  from  Zt~ 1  that  are  associated 
with  the  m ^  track). 

Assume  that  A (zt)  =  5ZtZ't  and  that  the  measurement  is  produced  from  one  or  more 
targets  represented  by  a  single  track.  Then 

p(zt\at)  =  p(zt\x<l1\...,x[M\kt^  =p(zt|4fct))  (9) 

and,  using  (4),  the  retrospective  support  becomes 

M°t)  =^2?  (zt\at)  Hzt)  =  P  (4\4kt))  •  (10) 

Z 

Substituting  (8)  and  (10)  into  (7),  the  belief  that  At  =  at  may  be  written 

M 

Bel(ot)  =  ap  (z't\x[kt)^  p  (kt)  JJ  p  .  (11) 

m=  1 


The  belief  of  the  node  Kt  may  now  be  obtained  by  integrating  the  target  states  out  of  the 
composite  variable  At,  that  is, 


Bel(fct)  =  ap  (kt)  j  p  (z[ \x[kt)^j  p  (x[kt)\Zt  ^  dxft] 


-  n 

m=l\kt  (m) 


p(x!m)|Z^)  dx [m\ 

=  ap  (kt)  J  P  (. z't\xtkt) )  P  (*ifct)| zi”1)  dx[kt), 


(12) 


where 


-l 


a  = 


£p(*7)  I  p  (^K(fct))  p  x)  dxlkt) 

kt  L, 


-(**) 
C£ 


Assuming  uniform  prior  probabilities  for  Kt,  (13)  may  be  written 


Bel(fct)  = 


Ixvh)  p  (zt\xtkt))  P 


{kt)\zt-l\  dJh) 


Em=i  fxM  P  (z't\xim))  p  (xjm)| Zt  dx[m) 


(13) 


(14) 


This  result  is  consistent  with  the  concept  of  d-separation,  where  the  knowledge  of  the  node 
Zt  allows  knowledge  of  the  state  of  the  node  Kt  to  be  inferred  from  the  knowledge  of  the 
track  states. 
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For  the  special  case  of  continuous  linear  Gaussian  systems,  (14)  becomes 


det  ( D(kt ))  2  exp  (z[  -  Hx[ktA  D(kt )  1  (z't  - 
Bel(A*)  = - V  x - - -  ry - ; - (15) 


Em=l  det  ( D(™ ))  *  eXP  (“5  (Zt  ~  Hxtm))  D(m) 


~llz't-  Hxt 


(m) 


(£)  .  .  .  U) 

;  is  the  error  covariance  of  the  track  state  estimate  x\  ,  H 


where  D{i)  =  HPt{i)HT  +  R,  Pt( 
is  the  measurement  matrix  that  maps  the  measurement  space  onto  the  track  state  space, 
and  R  is  the  error  covariance  of  the  measurement  Zt-  This  result  is  similar  to  the  nearest 
neighbours  approach  for  kinematic  association,  as  maximising  the  probability  is  equivalent 
minimising  the  argument  of  the  exponential. 

If  the  state  and  measurements  can  be  separated  into  independent  kinematic  and  at¬ 
tribute  components,  for  example,  states  x\kt>  and  w[kl>  with  measurements  zt  and  yt , 
respectively,  (14)  may  be  written  as  the  normalised  product  of  the  kinematic  and  at¬ 
tribute  posterior  probabilities.  Assuming  continuous  kinematic  states  and  discrete  at¬ 
tribute  states,  this  gives 


Bel(fct)  =  7t  f  P  (z[ \x[kt)^  p  Z*  dxft]  ; 

(hr l 


w 


(kt) 


P  I  Wt 


(kt.)  |^t— 1 


(16) 


_(*=*) 

Lt 


where 


7 1  = 


M 

E 

m—  1 


.(™) 

t 


Sm) 


P  \Zt\xi  P  Fi 


„(m)i  zt~1 


dx 


(m) 


E: 


W. 


(m) 


P  \wi 


(m) 


Y 


t- 1 


This  allows  the  attribute  and  kinematic  information  to  be  treated  separately  before  com¬ 
bining  into  a  single  probabilistic  metric. 


4.2  Constrained  Measurement-to- Track  Association 

The  single  measurement  problem  is  now  expanded  to  one  of  associating  N  mea¬ 
surements  at  some  time  t  to  M  tracks.  This  is  represented  by  the  BBN  of  Figure  4, 

(n) 

where  the  N  measurements  are  denoted  as  z±  ,  n  =  1,2,  ...,1V,  and  the  track  states 

as  m  =  1,2 ,...,M.  The  association  variable,  Kt,  is  now  the  N  x  1  vector,  kt  = 

(fcd\  kf2\  . . . ,  *f°)T,  kf'1'1  £  {0, 1,  2, ... ,  M},  where  the  value  of  the  element  is  the 

th 

number  of  the  track  to  which  the  nUi  measurement  is  associated.  A  value  of  0  indicates 
that  the  measurement  is  not  associated  to  any  track.  This  may  occur  if  one  or  more  of 
the  following  conditions  apply. 

1.  The  number  of  measurements  exceeds  the  number  of  tracks  ( N  >  M )  and  the 
constraint  that  each  target  may  only  produce  one  measurement  is  enforced. 

2.  A  false  alarm  may  occur  with  probability  greater  than  zero. 
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Figure  Jh:  BBN  for  associating  measurements  to  tracks 


3.  A  track  initiation  event  may  occur  with  probability  greater  than  zero. 


As  in  the  former  case,  the  solution  is  obtained  by  estimating  the  value  of  kt. 

The  BBN  of  Figure  4  contains  loops  in  the  underlying  Markov  network  that  have 
the  potential  to  introduce  deadlock  conditions  that  prevent  convergence  to  the  optimal 
solution.  The  clustering  technique  [Pearl  1988]  is  again  used  to  overcome  this  prob¬ 
lem,  producing  the  tree  structured  BBN  of  Figure  5.  As  for  the  unconstrained  case, 
the  clustered  variable  comprises  the  target  states  and  the  association  variable,  that  is, 
At  =  {x[l\  X^\  . . . ,  x[M\  Kt} ■  The  predictive  support  for  At  =  at  is  simply  its  prior 
probability,  namely 


M 

vr(at)  =  p  (at)  =  p  (kt)  P  l^_1)  •  (17) 

m—  1 


As  before,  it  is  assumed  that  the  prior  probabilities  of  the  track  states  and  the  association 
variable  are  independent,  which  is  valid  under  the  assumption  that  a  measurement  can 
only  be  associated  with  a  single  track.  This  should  be  reflected  in  the  prior  probability 
of  kt,  which  should  be  zero  for  any  values  containing  multiple  instances  of  the  same  track 
number,  with  the  exception  of  the  value  0,  for  which  multiple  occurrences  are  allowed. 

To  obtain  the  solution,  denote  the  known  value  of  as  z^',  the  retrospective 
support  as 

A(4n))  =  ,5  („)  (n),,  (18) 

Zt  Zt 

and  the  conditional  probabilities  as 


P 


n  =  1,  2, . . . ,  N. 


(19) 


11 


DSTO-RR-0315 


Figure  5:  Clustered  BBN  for  associating  N  measurements  to  M  tracks 


Substituting  this  and  (18)  into  (2),  the  retrospective  support  for  At  =  at  from  the  ra^ 
measurement  becomes 


Az(n)(at)  =  P  I  \xf  I  , 


(20) 


where  p  (z!^ ' may  be  viewed  as  the  probability  of  false  alarm  or  new  track.  From 
(3),  the  support  for  At  =  at  from  all  measurements  is 


N 


Kat) = n p 


(n)f\ 

A  \xt 


Sn) 


n=l 


Invoking  (1)  with  (17)  and  (21)  gives  the  belief  of  At  =  at  as 


(21) 


M 


m=  1 


Bel  (a*)  =  ap(fct)  p  (x<[m'>\Zt  ')  p 


N 


(n)  /i 

A  Ft 


n= 1 


(22) 


The  belief  of  Kt  =  kt  may  now  be  found  by  marginalising  At ,  that  is, 


Bel(fct)  = 


„  .  M  N 

ap  (h)  I  ■  I  n  p  (^m)i^i_i)  n p 

m=l  n= 1 


(n)/i 

A  Ft 


dxA1^  ■  ■  ■  dxA'1 


T(l)  r(M) 


mGfct  (m) 
m^O  x\ 


ap (h) n  / p m  n  p 


I  (n) _ 


4n)/  |xSm))  d4m) 


n  /  P  (z'-v-1)  ■te,K)  n 


pi4”'>i40) 


(23) 


(m/} 

xt 


n':k[n')=  0 
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Given  the  constraint  that  each  target  can  only  produce  one  measurement,  this  reduces  to 


where  a  is  a  normalising  constant  given  by 


Using  the  most  probable  value  as  the  best  estimate  for  the  association  vector,  a  may  be 
ignored,  and  the  solution  is  simply 


Again,  this  may  be  split  into  kinematic  and  attribute  components. 

The  key  assumption  of  a  maximum  of  one  measurement  associated  to  each  track  un¬ 
derpins  this  treatment.  Therefore,  the  prior  probability  mass  function  of  the  association 
vector  must  define  zero  probability  for  all  values  that  violate  this  assumption.  Under  linear 
and  Gaussian  assumptions,  the  solution  is  effectively  a  Global  Nearest  Neighbours  (GNN) 
association  algorithm  for  kinematic  and  attribute  data. 


4.3  Simulation  Results 

The  utility  of  joint  kinematic  and  attribute  association  is  demonstrated  in  the  following 
simulated  scenario.  The  scenario  comprises  two  crossing  straight  line  target  trajectories. 
Range,  azimuth  and  identity  measurements  are  provided  by  a  Secondary  Surveillance 
Radar  (SSR),  and  an  Electronic  Surveillance  (ES)  sensor  provides  azimuth  and  platform 
class  or  type  measurements.  Although  the  primary  focus  of  the  analysis  of  this  section  is 
the  performance  of  the  data  association  algorithms,  state  estimates  are  required  by  the 
data  association  algorithms  at  each  time  step.  For  convenience,  the  kinematic  tracking  is 
performed  using  an  Extended  Kalman  Filter  (EKF),  which  provides  a  linear  approximation 
to  the  non-linear  measurement  function,  and  the  identity  and  class  of  the  targets  is  updated 
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Table  1:  Data  association  accuracies 
(a)  Kinematic  only 


Sensor 

Track 

%  Correct 

%  Incorrect 

%  Missed 

SSR 

1 

86 

13 

1 

2 

85 

13 

2 

ES 

1 

58 

42 

0 

2 

57 

41 

2 

(b)  Joint  kinematic  and  attribute 


Sensor 

Track 

%  Correct 

%  Incorrect 

%  Missed 

SSR 

1 

99 

0-5 

0-5 

2 

98 

0-5 

1-5 

ES 

1 

71 

28 

1 

2 

71 

27 

2 

using  the  algorithm  presented  by  Krieg  [2002].  The  performance  of  the  state  estimation 
algorithms  is  deferred  to  Section  5.3. 

The  measurement-to-target  association  is  performed  in  two  steps.  In  the  first,  proba¬ 
bilistic  gating  is  used  to  eliminate  the  associations  with  a  low  probability  of  being  correct. 
The  constrained  association  algorithm,  as  described  in  Section  4.2,  performs  the  second 
association  step,  that  is,  determining  which  of  the  remaining  measurements  are  associated 
with  each  track.  A  Monte  Carlo  analysis  is  presented  for  two  cases,  namely  association 
using  only  kinematic  measurements,  and  association  using  both  kinematic  and  attribute 
measurements. 

For  the  case  of  kinematic  only  association,  a  significant  number  of  the  Monte  Carlo 
runs  result  in  the  tracks  swapping  targets  near  the  intersection  of  the  two  target  trajec¬ 
tories.  Such  a  run  is  presented  in  Figure  6(a),  where  the  £+’  and  ‘o’  represent  the  SSR 
measurements  from  different  targets — the  ES  measurements  are  not  presented  in  the  in¬ 
terest  of  picture  clarity.  On  the  other  hand,  the  joint  kinematic  and  attribute  association 
algorithm  successfully  traverses  both  targets  in  all  the  runs,  as  illustrated  in  the  single  run 
of  Figure  6(b). 

The  percentages  of  correct,  incorrect  and  missed  associations  for  both  tracks  under 
each  algorithm  are  presented  in  Tables  1(a)  and  1(b).  The  missed  associations  are  a  result 
of  the  probabilistic  gating  that  precedes  the  association  algorithm.  As  demonstrated  by 
the  results  in  the  tables,  the  use  of  the  attribute  data  for  data  association  increases  the 
degrees  of  discrimination  available  to  the  algorithm,  resulting  in  an  increase  from  around 
85%  correct  association  to  99%  for  the  SSR  data  for  this  scenario.  The  gains  are  similar 
for  the  association  of  the  ES  data;  the  percentage  of  correct  associations  rises  from  below 
60%  for  kinematic  only  association  to  just  above  70%  for  joint  kinematic  and  attribute 
association.  The  lower  association  accuracy  for  the  ES  data  is  an  artifact  of  the  relatively 
inaccurate  bearing  measurements  and  the  lack  of  range  discrimination  of  the  ES  sensor. 


14 


19.80'  Longitude  12°  S  19.65'  19.80'  Longitude  12°  S  19.65' 


DSTO-RR-0315 


Figure  6:  Track  location  with  constrained  association 
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Table  2:  Identity  estimation  performance  for  kinematic  and  joint  data  association 


Track 

Threshold 

First  exceeds 

Exceeds 

Kinematic 

Joint 

Kinematic 

Joint 

0.70 

1 

1 

92% 

92% 

0.75 

5 

5 

87% 

87% 

0.80 

7 

7 

85% 

85% 

1 

0.85 

7 

7 

85% 

82% 

0.90 

7 

9 

70% 

80% 

0.95 

9 

9 

53% 

80% 

0.70 

1 

1 

97% 

97% 

0.75 

3 

3 

95% 

95% 

0.80 

3 

3 

95% 

95% 

2 

0.85 

3 

3 

95% 

92% 

0.90 

5 

5 

87% 

90% 

0.95 

5 

5 

68% 

87% 

The  majority  of  incorrect  associations  occur  when  the  targets  are  in  close  proximity  and 
following  a  track  swap  event. 

Although  this  example  demonstrates  increased  data  association  accuracy  for  the  joint 
kinematic  and  attribute  association  over  the  kinematic  only  association,  the  magnitude 
of  this  increase  is  scenario  dependent.  For  example,  Khosla  &  Chen  [2003]  considered 
the  inclusion  of  feature  information  in  Dempster-Shafer  and  Probabilistic  Argumentation 
System  (PAS)  association  algorithms.  They  compared  the  kinematic  and  kinematic  plus 
feature  association  using  a  more  difficult  three  target  scenario  where  the  targets  were  on 
the  limits  of  the  sensor  resolution  for  a  significant  period,  and  achieved  a  probability  of 
77%  correct  association  for  kinematic  only  association,  and  98%  correct  association  when 
using  both  kinematic  and  feature  information.  The  greater  difficulty  inherent  in  their  sce¬ 
nario  is  manifest  in  the  lower  performance  for  kinematic  only  association,  but  the  similar 
performance  when  feature  information  is  introduced  indicates  that  greater  benefit  may 
be  realised  from  the  more  difficult  scenarios.  That  is,  little  benefit  is  expected  for  well 
separated  targets,  where  the  simplest  of  association  algorithms  will  provide  adequate  per¬ 
formance,  but  the  attribute  data  may  be  the  difference  between  adequate  and  inadequate 
performance  in  dense  target  scenarios  where  the  association  gates  of  the  tracks  overlap. 

The  effect  of  kinematic  only  and  joint  kinematic  and  attribute  data  association  on 
the  identification  probabilities  obtained  from  multiple  Monte  Carlo  runs  are  presented  in 
Figures  7  and  8,  respectively.  Table  2  contains  a  comparative  summary  of  the  performance 
of  the  algorithm  in  terms  of  the  number  of  the  scan  at  which  the  identification  probability 
first  exceeds  a  particular  threshold  and  the  total  percentage  of  scans  for  which  the  threshold 
is  exceeded,  averaged  over  all  the  runs.  The  results  are  presented  for  a  range  of  probability 
thresholds  from  0.7  to  0.95. 

It  is  evident  that  the  initial  performance  is  almost  identical  for  both  association  al¬ 
gorithms,  as  indicated  by  the  figures  and  the  First  exceeds  columns  of  Table  2.  This  is 
expected,  as  the  two  targets  are  well  separated  and  easily  discriminated  during  this  period. 
However,  the  identification  probabilities  for  the  kinematic  only  association  falls  below  that 


16 


DSTO-RR-0315 


0.8 

0.7 

Q-  - 

:  9; 

0  • 

g 

Q.  0.5 

'•  Q 

o 

9  QA 

.  □ 

0.3 

/  o 

.  □ 

O  G  O  O  O 


©;;■;©  ;;Q:. 


■  S  "  O' 


4.02  6.02  8.02  10.02  12.02  14.02  16.02  18.02  20.02  22.02  24.02  26.02  28.02  30.02  32.02  34.02  36.02  38.02  40.02  42.02  44.02 

Time  (s) 


□ . & 


.  P 

0  ®-'° 


(a)  Track  1 

□  □  □  ■  ■  □  ■  □  D  □  E  Q 


B  . 


©  o 


38.02  40.02  42.02  44.02 


Time  (s) 


(b)  Track  2 

Figure  7;  Identification  probabilities  using  kinematic  only  association 
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Figure  8:  Identification  probabilities  using  joint  kinematic  and  attribute  association 
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of  the  joint  for  the  higher  thresholds,  as  shown  in  the  Exceeds  columns  of  Table  2.  This 
is  supported  by  the  falling  probability  for  the  correct  identities  in  Figure  7,  caused  by  the 
higher  number  of  incorrect  associations  for  the  kinematic  only  association  that  result  from 
the  tracks  swapping  targets. 


5  State  Update 

State  estimation,  or  state  update,  problems  may  also  be  modelled  by  BBNs.  The 
state  or  states  of  interest  are  represented  by  the  hypothesis  nodes  of  the  BBN,  and  the 
measurements  by  the  evidence  nodes.  The  BBN  grows  over  time:  in  the  case  of  static  or 
non- evolutionary  state  variables,  evidence  nodes  are  added  as  new  measurements  become 
available;  and  for  dynamic  or  evolutionary  state  variables,  evidence  nodes  and  hypoth¬ 
esis  nodes  (representing  the  new  or  updated  states  of  the  variables  at  the  time  of  the 
measurement)  are  added  when  new  measurements  become  available. 

Solutions  are  developed  and  demonstrated  using  a  simulated  target  tracking  and  recog¬ 
nition  application  for  both  evolutionary  and  non-evolutionary  state  variable  estimation. 


5.1  Evolutionary  State  Update 

The  evolutionary  states,  that  is,  the  states  that  change  value  over  time,  include  both 
continuous  and  discrete  state  variables.  In  tracking  applications,  the  most  commonly 
occurring  continuous  evolutionary  state  variables  are  the  kinematic  states  of  a  target,  that 
is,  position,  velocity  and,  possibly,  acceleration.  Discrete  evolutionary  states  may  include 
variables  such  as  the  target  flight  level  and  weapon  status. 

The  evolutionary  state  variables  are  handled  by  treating  the  time  varying  variable  as  a 
sequence  of  time  invariant  variables  at  specific  times  of  interest,  such  as  the  measurement 
times.  By  way  of  an  example,  consider  a  time  varying  state  variable  X,  and  a  set  of 
measurements  up  to  and  including  time  t,  that  is,  Zl  =  {Z i, . . . ,  Zt}.  Initially  the  BBN  will 
consist  solely  of  the  node  Xq,  which  represents  the  state  of  the  variable  T  at  a  time  instant 
prior  to  the  arrival  of  any  measurements.  At  the  time  of  each  measurement,  two  new  nodes 
are  added  to  the  BBN,  one  representing  the  state  of  the  variable  at  that  measurement 
time  and  the  other  the  measurement.  This  gives  rise  to  the  recursive  structure  of  the  BBN 
illustrated  in  Figure  9.  The  conditional  probability  p{xt\xt~i)  describes  the  evolution 
of  the  state  of  the  variable  X  from  time  t  —  1  to  t,  and  p(zt\xt),  the  likelihood  of  the 
measurement  Zt  =  Zt  given  that  the  value  of  X  is  x. 

The  addition  of  new  nodes  will  cause  the  network  to  grow  over  time.  However,  the 
objective  is  to  estimate  the  state  of  X  at  some  time  t,  based  on  the  evidence  or  measure¬ 
ments  up  to  and  including  that  time.  As  such,  it  is  not  necessary  to  update  the  belief 
of  the  state  of  the  nodes  at  times  prior  to  t.  Therefore  the  size  of  the  network  may  be 
contained  by  making  the  node  Xt-i  a  root  node  with  prior  probability  p  (x4_i|Zi_1),  and 
removing  all  other  nodes  at  times  up  to  and  including  t  —  1. 
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Figure  9:  A  BBN  for  updating  evolutionary  state  variables 


Consider  the  BBN  of  Figure  9,  and  let  BeV(xt)  be  the  posterior  probability,  or  belief, 
that  Xt  =  Xf,  given  all  the  measurements  up  to  and  including  the  time  t' .  The  belief  that 
Xq  =  xq  at  time  0  is  the  prior  probability  of  x,  that  is, 

Belo(xo)  =  p  (x0)  ■  (27) 

Then,  at  each  recursion,  the  belief  at  the  new  state  node  becomes 

Beli(xt)  =  p  (xtlZ1)  =  atXt{xt)TTt{xt),  (28) 

where  at  is  a  normalising  constant  at  time  t.  Given  A (zt)  =  5Zt~'t  and  using  (2),  the 
retrospective  support  may  be  written  as 

\t(xt)  =  j  p  (z\xt)  A (zt)dz  =  p  (z't \xt)  .  (29) 


The  predictive  support,  TTt(xt),  represents  the  prior  knowledge  of  the  state  at  the  time  t 
given  the  prior  knowledge  of  the  state  at  time  0  and  all  the  evidence  or  measurements  up 
to  the  time  t.  Using  (5),  (6)  and  (1),  the  predictive  support  becomes 
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VTt(xt) 


Xt-1 


p(xt\xt-i)  TTXt(xt-l)dXt-l 

(30) 

p{xt\xt-i)  AZi_1(®t-i)vr(a;t_i)dxt_i 

(31) 

p  (xt\xt-i)Belt-i(xt-i)  dxt- 1. 

(32) 

Xt-1 


Strictly  speaking,  Belf_i(xt_i)  =  at-iXzt-i(xt-i)^(xt-i) ,  but  the  normalising  constant 
at- 1  is  absorbed  by  the  new  normalising  constant  at  and  is  omitted  here.  The  belief  that 
Xt  =  xt  then  becomes 


Belt(xt)  =  at p  (z't \xt) 


Xt-1 


p  (xt\xt-i)  Belt_i(xt_i)  dxt-i 


(33) 


or,  in  terms  of  the  posterior  probabilities, 


P  {xt\Zt) 


Xt-1 


p  (xt\xt-i)  p  (xt-i\Zt  x)  dxt-i. 


(34) 


The  normalising  constant  at  may  be  dropped  because  the  estimate  of  xt  is  found  by 
maximising  (33),  that  is, 

xt  =  arg  nrax(Beh(a:i)).  (35) 

Xt 


The  tracking  problem  is  one  of  obtaining  the  best  state  estimate  at  the  current  time. 
Therefore,  it  is  not  necessary  to  update  the  beliefs  of  the  state  variable  at  times  other 
than  t.  The  solution  for  the  special  case  of  continuous  variables,  for  example,  kinematic 
states,  for  linear  and  Gaussian  systems  may  be  obtained  by  applying  the  Kalman  filter  to 
the  measurements.  This  result  may  also  be  applied  to  the  estimation  of  discrete  variables 
by  replacing  the  integrals  with  summations.  Track  smoothing  algorithms,  which  form 
estimates  of  the  state  variables  using  both  the  past  and  future  measurements,  may  be 
developed  by  calculating  Bel^(xt)  for  all  t  <  t'  from  the  BBN  in  Figure  9. 


5.2  Non-Evolutionary  State  Update 

A  non-evolutionary  variable  is  represented  by  a  single  node  in  a  BBN,  as  its  value 
does  not  change  over  time.  The  BBN  for  this  class  of  problem  is  also  dynamic,  as  new 
measurement  nodes  are  added  to  the  network  at  each  measurement  time.  The  target 
recognition  problem,  that  is,  target  type  and  identity  (ID)  estimation,  is  used  here  to 
illustrate  the  development  of  a  solution  to  the  problem  of  estimating  non-evolutionary 
state  variables. 
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Figure  10:  A  non- evolutionary  BBN  for  target  identification  and  classification 


measurements  Sn,  n  =  1  ,  ...,1V,  and  Electronic  Surveillance  (ES)  measurements  Em, 
m  =  1, . . . ,  M.  Although  the  SSR  and  ES  measurements  are  not  required  to  occur  simul¬ 
taneously,  the  dependency  between  class  and  identity  dictates  that  both  are  updated  on 
receipt  of  either  an  ES  or  SSR  measurement.  The  choice  of  whether  the  class  is  dependent 
on  the  identity,  or  vice  versa,  is  arbitrary.  In  this  example,  it  is  assumed  that  the  number 
of  each  type  of  platform,  that  is,  the  probability  of  each,  is  determined  by  the  relevant 
nation  or  force,  hence  the  dependency  of  class  on  identity. 

The  special  case  of  discrete  variables  is  presented  here  as  both  class  and  identity  take 
discrete  values,  and  only  the  discrete  target  identity  and  class  measurements  are  considered 
from  the  SSR  and  ES  sensors,  respectively;  the  kinematic  measurements  are  ignored  for 
the  purpose  of  this  treatment.  The  generalisation  to  the  continuous  domain  is  achieved 
by  replacing  the  summations  with  integrals  in  the  following. 

For  the  example  of  Figure  10,  denote  s'n  and  e!m  as  the  measured  values  from  the 
SSR  and  ES,  respectively.  Then  using  (1)  through  (6)  and  it (i)  =  p  (i),  the  belief  in  the 
proposition  I  =  i  is 


Bel(z)  =  aX(i)n(i) 


N 


a\c{i)  IMpW 


(36) 


n=  1 
N 


M 


n= 1 


c 


m= 1 
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Similarly,  the  posterior  probability  that  C  =  c  is  written 

M  N 

Bel(c)  =  a  p  (em|c)  ^  p  (c\i)  p  (i)  p  (sn|i) .  (37) 

m=  1  i  n= 1 

Prior  to  any  measurements,  (36)  and  (37)  reduce  to  the  prior  probabilities  of  I  and  C, 
namely 


Bel(i)  =  p  (i) ,  and 
Bel(c)  =  5^p(c|*)p(»)  • 


(38) 

(39) 


As  the  measurements  are  added  to  the  BBN,  (36)  and  (37)  are  recalculated  to  give  the 
latest  posterior  probabilities. 

The  solution  is  not  recursive  in  its  current  form.  However,  the  following  outlines  a 
partially  recursive  solution.  Defining 


p  (S’0!*)  =  1,  and 
p(-E°|c)  =  1, 

the  recursive  conditional  measurement  probabilities  are  defined  as 

p  (<Sn|z)  =  p  (Sri~1|i)  p  (sn|i) ,  and 
p(Em\c)=p(Em-1\c)p(em\c), 


(40) 


(41) 

(42) 


for  all  the  values  of  i  and  c.  Based  on  these  definitions,  the  beliefs  (36)  and  (37)  become 


Bel(i)  =  ap(i)p  (Sn\i)  p  (c|i)  p  (Em\c) ,  and  (43) 

C 

Bel(c)  =  op  (Em\c)  p  (c|i)  p  ( i )  p  (5'n|i) .  (44) 

i 

The  solution  is  then  found  by  recursively  updating  the  conditional  measurement  proba¬ 
bilities,  which  are  then  used  to  recalculate  the  identity  and  class  beliefs. 

The  BBN  in  Figure  10  may  be  extended  to  include  other  sources  of  identity  and  class 
measurements,  such  as  target  dynamics  and  non-cooperative  target  recognition.  Addi¬ 
tional  state  variables,  such  as  category  (air,  surface,  subsurface,  land,  and  space),  may 
also  be  added  to  the  BBN  model. 


5.3  Simulation  Results 

The  crossing  target  scenario  of  Section  4.3  is  used  here  to  demonstrate  the  state  esti¬ 
mation  algorithms.  Again  SSR  and  ES  sensor  measurements  are  simulated.  The  simulated 
SSR  measurement  errors  have  a  standard  deviation  of  0.5°  and  20  m  for  bearing  and  range, 
respectively.  The  SSR  also  returns  an  identity  from  the  set  {1,  2, 3},  with  a  probability  of 
0.7  that  the  correct  identity  of  1  is  received  from  Target  1,  and  a  probability  of  0.7  that  the 
correct  identity  of  2  is  received  from  Target  2.  The  standard  deviation  of  the  errors  from 
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Figure  11:  SSR  measurements  and  track  positions  for  the  crossing  target  scenario 


the  ES  sensor  is  2°,  and  the  sensor  returns  a  platform  class  from  the  set  {1,2,  3, 4}  with 
a  probability  of  0.7  that  the  correct  class  of  1  is  received  from  Target  1,  and  a  probability 
of  0.7  that  the  correct  class  of  3  is  received  from  Target  2.  The  remaining  values  for  the 
identity  and  class  measurements  occur  equi-likely.  A  single  measurement  is  generated  for 
each  target  from  both  sensors  simultaneously  at  one  second  intervals. 

As  data  association  is  a  necessary  precursor  to  state  estimation,  the  simulated  mea¬ 
surements  are  input  to  the  constrained  data  association  algorithm  of  Section  4,  which  uses 
both  kinematic  and  attribute  data  to  associate  the  measurements  to  the  tracks.  (The 
performance  of  the  data  association  algorithm  was  analysed  in  Section  4.3  and  is  not  di¬ 
rectly  analysed  here.  However  it  is  acknowledged  that  it  will  influence  the  performance 
of  the  state  estimation  algorithms.)  The  associated  measurements  are  then  processed  by 
the  state  estimation  algorithms,  that  is,  the  Kalman  filter  (Section  5.1)  to  update  the 
kinematic  states,  and  the  non-evolutionary  state  estimation  algorithm  of  Section  5.2  to 
update  the  identity  and  class  of  the  targets.  The  kinematic  state  estimation  and  attribute 
(target  class)  state  estimation  algorithms  are  processed  independently,  with  each  estimat¬ 
ing  different  target  state  variables  using  different  sets  of  measurements.  Both  tracking  and 
data  association  are  performed  in  polar  coordinates. 

The  track  positions  from  a  single  simulation  run  are  presented  in  Cartesian  coordi¬ 
nates  in  Figure  11,  where  the  solid  lines  represent  the  tracks,  the  dotted  lines  the  true 
target  trajectories,  and  the  dots  the  SSR  measurements.  In  almost  all  the  runs,  the  tracks 
correctly  follow  the  crossing  targets.  The  ability  to  accurately  estimate  the  identity  and 
class  of  the  targets  improves  the  data  association  performance,  and  hence  the  kinematic 
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Figure  12:  Identity  probabilities  for  the  crossing  target  scenario 


0.8 

0.7 

0.6 

5  05 
0.4 

0.3 

0.2 


0  5  10  15  20  25  30 

Time  (s) 


(a)  Track  1  (b)  Track  2 

Figure  13:  Class  probabilities  for  the  crossing  target  scenario 
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tracking  performance.  The  figures  12(a)  and  12(b)  show  the  estimated  probabilities  for 
each  of  the  possible  identities  over  the  first  30  seconds  for  tracks  1  and  2,  respectively.  The 
blue,  red  and  green  lines  represent  the  probabilities  for  identities  1,  2  and  3,  respectively. 
Within  a  few  measurement  updates,  the  probability  of  identity  1  for  Track  1  correctly 
approaches  unity.  Likewise,  the  probability  of  identity  2  for  track  2  also  approaches  unity 
within  the  receipt  of  a  few  measurements.  The  presence  of  subsequent  conflicting,  or  incor¬ 
rect,  identity  measurements  reduce  these  probabilities,  particularly  early  in  the  scenario 
where  fewer  measurements  contribute  to  the  identity  estimation.  This  effect  is  evident  in 
Figure  12(a),  where  an  erroneous  identity  3  measurement  at  three  seconds  increases  the 
probability  of  identity  3.  The  lessening  effect  of  conflicting  measurements  as  more  correct 
measurements  are  received  is  illustrated  by  the  near  constant  identity  probabilities  after 
ten  seconds. 

The  Figures  13(a)  and  13(b)  present  the  probabilities  for  each  of  the  possible  classes 
over  the  first  30  seconds  for  both  tracks  1  and  2,  with  the  blue,  red,  green  and  cyan 
lines  representing  the  probabilities  for  classes  1,  2,  3  and  4,  respectively.  Within  several 
measurement  scans,  the  probability  of  Class  1  correctly  approaches  unity  for  Track  1, 
and  the  probability  of  Class  3  approaches  unity  for  Track  2.  As  for  the  identity,  the 
effect  of  erroneous  measurements  shortly  after  track  initiation  is  again  illustrated,  where 
an  incorrect  class  4  measurement  at  five  seconds  increases  the  probability  of  Class  4  and 
decreases  the  probability  of  Class  1  for  Track  1,  as  shown  in  Figure  13(a).  In  addition, 
because  of  the  dependency  between  class  and  identity,  incorrect  identity  measurements 
influence  the  class  probabilities  and  incorrect  class  measurements  influence  the  identity 
probabilities. 


6  Joint  Tracking  and  Classification 

Consider  an  example  tracking  and  recognition  application  where  the  class  and  kine¬ 
matic  states  of  an  object  are  estimated  from  inaccurate  class  and  kinematic  measurements. 
This  problem  may  be  represented  by  the  BBN  in  Figure  14,  where  the  variable  C  repre¬ 
sents  the  (time  invariant)  target  class,  X k  the  kinematic  state  at  time  k ,  Z k  the  target 
kinematic  measurement  at  time  k ,  and  Sk  the  class  measurement  at  time  k.  The  BBN 
grows  over  time,  with  the  addition  of  new  X ,  Z  and  S  nodes  at  each  time  step.  This 
growth  at  each  time  step  provides  a  framework  for  developing  a  recursive  algorithm. 

The  BBN  of  Figure  14  contains  loops  in  the  underlying  Markov  network;  therefore  it 
requires  an  iterative  algorithm  to  obtain  a  solution,  to  which  it  may  or  may  not  converge. 
As  for  previous  cases,  this  problem  is  overcome  by  aggregating  interconnected  groups  of 
variables  into  composite  variables  or  clusters  [Pearl  1988,  Krieg  2001].  An  equivalent 
clustered  BBN  is  presented  in  Figure  15,  where 


Y0  =  {X0,C}, 

Yk  =  {Xk,Xk^,C}  k>  1, 

Ak  =  {Sk,C}  k>  1, 

Bk  =  {ZklXk}  k  >  1. 


(45) 
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Figure  If:  BBN  representation  of  a  joint  tracking  and  classification  problem 


The  conditional  probabilities  associated  with  each  link  between  the  composite  nodes 
are  the  joint  probability  of  all  the  variables  in  the  child  composite  node  conditioned  on  all 
the  variables  common  to  both  nodes  connected  by  the  link,  that  is, 

v(Vk\yk-i)  =  p(xk,xk-i,c\xk-i}c)  =-p(xk\xk-i,c) , 
p(afe| Vk)  =  p(sfc,c|c)  =  p(sfc|c) ,  (46) 

P  (bk\yk)  =  p  (zk,  xk\xk )  =  p  (zk \xk) . 

The  clustered  BBN  is  capable  of  representing  states  containing  various  kinematic  and 
attribute  data,  where  the  attributes  may  include  any  of  target  identity,  category  and 
class.  In  general,  the  composite  variable  Ak  represents  all  measurements  relating  to  time 
invariant  parameters,  and  Bk  those  relating  to  time  varying  parameters.  The  variables  of 
interest  are  obtained  by  marginalising  the  composite  state  variables. 

Based  on  this  generality,  the  approach  here  is  to  initially  solve  for  the  composite  state 
variables  Yk,  k  =  0, 1,  2, ... .  As  the  belief  is  a  posterior  probability,  the  belief  that  the 
state  variable  Yk  =  yk,  given  all  measurements  up  to  time  k,  is  denoted  p  (yk\Ak,  Bk'j . 


26 


DSTO-RR-0315 


Figure  15:  Equivalent  clustered  BBN  for  joint  tracking  and  classification 


At  the  time  0,  the  BBN  contains  the  single  variable  Yq,  with  prior  probability  p  (yo)  = 
p  [yo\A°,  B°),  where  A 0  and  B°  are  the  (empty)  sets  of  measurements  up  to  time  0.  Using 
Figure  14  and  equation  (45),  the  prior  probability  of  Yq  =  y0  is 

p(l/o)  =  p(so|c)p(c)  ■  (47) 


Now,  consider  an  arbitrary  time  k,  where  the  BBN  terminates  with  nodes  Yk,  A k  and  B k- 
From  (1)  and  (3),  the  belief,  p  (jjk\Ak ,  Bk) ,  is  the  normalised  product  of  ir {yk),  A Ak(lJk ) 
and  A Bk{Vk)-  Assuming  that  A/,  only  contains  instantiated  leaf  or  measurement  nodes, 
and  nodes  common  to  both  itself  and  Yk,  the  value  of  Ak  is  denoted  a'k,  and  A (a^)  =  5 aka'ki 
which  equals  1  if  =  a'k  and  zero  otherwise.  Therefore,  using  (4), 


A Ak(yk)  =  /  P  (ak\yk)  \{ak)dak 


ak 


=  P  {a!k\yk)  . 


(48) 


Similarly, 

Aefc(yfe)  =p  {b'k\yk)  ■  (49) 


Assuming  TTYk(yk-i)  is  known,  (5)  gives 


ir{yk) 


P  (yk\yk-i)  TTYk(yk-i)dyk-i, 


Uk-l 


(50) 
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and 


(yk\Ak,Bk'j  =  Belk(yk)  =  akp  (ak\yk)  p  (bk\yk)  j  p  (yk\yk-i)  ^Yk{yk-i) dyk-i-  (51) 


Uk- 1 


Adding  another  variable  Yk+i  to  the  BBN,  that  is,  the  state  at  time  k  +  1,  (6)  provides 

^Yk+1(yk)  =  p(4lyfc)p(6fcM  /  p(yfc|yfc-i)7Tyfc(yfc-i)<%-i 


Uk-1 


(52) 


=  an 


;XP 

Therefore,  TrYk(yk- 1)  =  a^p  (yk-i  \Ak~L,  Bk~l) ,  and  (51)  becomes 

c,£A:)  =  afcp  (a'k\yk)  p  (bk\yk)  ak\  J  p  (yk\yk-i)  P  (j/fc-il^*-1,  dj/fc-i, 

(53) 


Vk-l 


where 


-l 


y/fc 


(4M  P  KM  ak-i  /  p(yfc|yfc-i)p  (WiM  X)  dyk_xdyk 


Vk-l 


(54) 

Substituting  the  definitions  of  (45)  and  the  probabilities  of  (46),  and  absorbing  the 
constant  aj^  into  ak,  the  posterior  probability  (53)  becomes,  in  terms  of  the  original 
variables, 


p  (xk,  Xfe_i,  c\Zk,  SkJ  =  ak p  (4|c)  p  (zk\xk)  p  (xk\xk-i,  c) 

x  J  p  (xk-1,xk-2,c\Zk~1,Sk~1')  dxk_2 

Xk-2 

=  «AP  (4IC)  P  (4l*fc)p(s*ls*-i>c)p  (sfc-i,c|Zfc-1,5fe-1 

Integrating  over  Xfc-i  gives 

p  (xfc,  c|Zfc,  =  afcp  (4|c)  p  (4|xfc) 

x  /  p(xfc|xfc_i,c)p  dxfc_i 


(55) 


*^fe  — 1 


=  afcp  (4|c)  P  1,5fc 

x  y  p(xfe|xfe_i,c)p  (xfc_i|c,  dxfc_i 


(56) 


®fc-l 


From  Bayes  Rule, 


pKIM  /  p(xfc|xfc_i,c)p  (xfc_i|c,Zfc  1,S'fc  ^ 


•^fc  — 1 


(57) 


=  p(xk\c,  Zk,Sk~1)  P(k\ 
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where 


Pk]  =  J  P  (Zk\xk)  j  p(xfe|xfe-i,c)p  ^Xfc-i|c,  Zk  1,Sk  l^j  dxk-idxk-  (58) 

xk  xk  —  1 

Similarly, 

p(4|c)p(c|Zfc-1,5fc-1) 

=  p(c|Zfc-1,5fc)7*> 

where 

Ik  =  ^P  (4|c)  p  [c\Zk-1 ,  Sk~l^j  . 

c 

Substituting  (57)  and  (59)  into  (56)  gives 

P  (xk,  c\Zk ,  Sk^j  =  7fcp  ^c|Zfc_1,  Sfc)  /3^c)p  (xk\ c,  Zk,  Sk^  ,  (61) 


(59) 

(60) 


where 


p  (xk\c,Zk,Sk  1  j  =  Plc)  Xp  (zk\xk)  j  p(xfc|xfc_i,c)p  (xk-i\c,  Zk  1,Sk  ^  dxk-i, 

xk  —  1 

(62) 

p(c|Zfc-\,S*)  =7-1p(4|c)  p  (cIZ^-1,^-1)  ,  (63) 

and  and  7 k  are  defined  in  (58)  and  (60),  respectively. 

The  class  given  all  the  class  and  kinematic  measurements  is  obtained  by  integrating 
(61)  over  all  values  of  xk,  that  is, 

p  (c\Zk,  Sk )  =  7fcP  (c|Zfc_1,  4C)  j  p  (sfc|c,  Zk,  Sk -1)  dxk 

xk  (64) 

=  7fcP  (c\Zk~\Sk)^\ 

Similarly,  the  kinematic  state  given  all  the  kinematic  and  class  measurements  is  now 
obtained  by  summing  (61)  over  all  values  of  c,  that  is, 


p  ( xk\Zk ,  Sk)  =  7fc  E  P  (clz*_1> sk )  $ P  ( *k\c ,  Zk ,  5fc”1) 

C 

=  YJv(c\Zk,Sk)v(xk\c,Zk,Sk -1)  . 

c 


(65) 


Therefore  the  probabilities  p  (xk\c,  Zk,  Sk  1)  and  p  [c\Zk  1,Sk )  may  be  recursively 
estimated  at  each  time  step  using  equations  (62)  and  (63),  and  the  prior  probabilities 

p  (xo|c,  Z°,  S° )  =  p  (x0|c) ,  and  (66) 

p(c|Z°,5°)  =p(c).  (67) 
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(Z°  and  S°  denote  the  empty  kinematic  and  attribute  measurement  sets  at  time  0.)  From 
these,  the  posterior  probabilities  p  (c\Zk,  Sk )  and  p  (xk,  \Zk,  Sk )  may  be  calculated  at  each 
time  step  using  (64)  and  (65).  These  posterior  probabilities  are  then  used  to  obtain  the 
most  likely  values  for  these  variables. 

This  approach  differs  from  that  of  Challa  &;  Pulford  [1999],  which  incorporates  a  time 
variant  model  for  the  target  class  that  uses  a  transition  matrix  to  define  the  probability 
that  the  target  class  will  change.  Although  the  target  class  does  not  change,  in  practice 
it  may  prove  useful  in  allowing  more  rapid  correction  of  incorrect  classification  decisions, 
or  may  assist  in  correcting  tracking  errors,  such  as  track  swaps. 


6.1  Linear  Gaussian  Kinematic  Systems 


Linear  Gaussian  systems  comprise  a  linear  target  motion  model  with  white  Gaussian 
additive  noise,  which,  for  the  time  interval  k  —  1  to  k,  may  be  defined  as 


Xk  —  Fk—lXk— 1  T  Wk— i, 


(68) 


where  Fk_ i  represents  the  state  transition  matrix  and  wk-\  is  the  zero  mean  process 
noise  with  covariance  Qk-i •  The  conditional  probability  p  (xk\xk-i,c)  is  distributed  as 

Af  (xk  '■  Q^k-i)  >  an(l  the  prior  probability  p  (xo)  as  Af  (xq  :  Xq  \  FqC^  ,  where 

Af  (•)  denotes  the  normal  or  Gaussian  distribution,  and  denotes  the  class  to  which  the 
model  applies.  Similarly,  the  linear  Gaussian  kinematic  measurement  model  with  white 
Gaussian  additive  noise  for  class  c  at  time  k  is  given  by 

zk  =  H(kc)  x[c)  +  v[c) ,  (69) 

(c) 

where  H k  is  the  measurement  matrix  that  maps  the  state  space  onto  the  measurement 

(c)  .  ...(c) 

space,  vk  ;  is  the  zero  mean  measurement  noise  with  covariance  Rj.  ,  and  the  conditional 
probability  p  (zk \xk)  is  distributed  as  Af  (zk  :  H^x^,  .  The  attribute  measurement 

model  is  a  probability  mass  function  representing  the  discrete  measurement  probabilities, 
given  the  state. 

Substituting  the  Gaussian  probabilities  into  (62)  and  (57),  and  rearranging,  gives 
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(c)  (c) 

where  x\  and  P\ .  are  the  mean  and  covariance  of  the  state  estimate  for  model  c  at  time 
i  given  all  the  measurements  up  to  and  including  time  j,  and 


/t’=  / 


(c) 

A 


N 


AA 


.  Z?(c)  r(c) 

•  ^  k— 


0(c)  ^  A7  (Vc)  •  x(c)  P(c)  ^  dx[c)  dx[c)  (71) 

^k~l)  \Xk-l  ■  Xk-l\k-Vrk-l\k-l)  aJLk-laXk  y  > 


(c) 

cfc-l 


r(c)  p(c) 


(c)T 


+  . 


The  solution  to  (70)  may  be  obtained  using  a  Kalman  filter  for  each  class,  which  may 
then  be  used  in  (65)  and  (64)  to  obtain  the  solutions  for  p  (x*,| Zk,Sk)  and  p  (c\Zk,Sk), 
respectively.  In  particular,  p  [xk\Zk ,  Sk)  may  be  calculated  using  a  multiple  model  Kalman 
filter,  where  the  model  probabilities  are  p  (c| Zk,  Sk )  =  y^p  (c|Zfc_1,  Sk )  j3^ . 


6.2  Simulation  Results 

A  scenario  with  a  single  target  that  performs  a  2  g  and  a  0.5  g  turn  is  used  to  evaluate 
the  joint  tracking  and  classification  algorithm.  The  target  may  take  one  of  two  possible 
classes,  namely  Cl,  which  is  a  non-manoeuvring  platform,  and  C2,  which  is  capable  of 
performing  significant  manoeuvres.  The  sensor  SI  provides  bearing  and  range  measure¬ 
ments  and  is  representative  of  a  primary  radar,  and  sensor  S2  provides  bearing  and  class 
measurements  and  is  representative  of  an  ES  sensor.  The  class  measurements  of  S2  are 
correct  80%  of  the  time.  Sensor  SI  provides  measurements  at  1  Hz,  and  S2  provides 
measurements  at  0.2  Hz.  The  non-linear  measurement  function,  which  maps  the  polar 
coordinate  sensor  measurement  space  onto  the  Cartesian  state  space,  is  linearised  using 
an  EKF. 

The  estimated  track  position  (solid  line)  and  measurements  (dots)  from  SI  are  shown 
in  Figure  16,  with  the  estimates  from  the  constant  velocity  and  manoeuvring  models  in 
Figures  17(a)  and  17(b),  respectively.  These  figures  show  that,  after  the  first  manoeuvre 
when  the  probability  of  class  C2  is  almost  unity,  the  kinematic  state  estimates  from  the 
manoeuvring  model  dominate  the  positional  estimates.  This  is  expected,  as  the  class 
probabilities  provide  the  summation  weights  for  the  multiple  model  kinematic  tracker. 

Figure  18  shows  the  estimated  probability  that  the  target  is  class  C2  over  time.  The 
probability  increases  with  the  number  of  class  measurements  over  the  first  20  s  of  the 
scenario,  albeit  with  the  occasional  decrease  in  the  probability  caused  by  the  20%  of  class 
measurements  that  are  incorrect.  The  first  manoeuvre,  which  occurs  after  20  s,  reinforces 
class  C2,  significantly  increasing  its  probability.  The  incorrect  measurements  now  have 
less  impact,  particularly  after  the  second  manoeuvre  at  40  s,  which  also  supports  the  C2 
class. 

To  further  understand  the  interaction  between  the  kinematic  and  attribute  measure¬ 
ments  and  states,  the  scenario  is  repeated  with  the  class  measurements  supporting  class 
Cl,  which  is  in  direct  conflict  with  the  target  dynamics  employed  in  the  scenario.  This 
situation  is  artificial  and  should  only  occur  in  practice  if  the  measurements  are  incorrectly 
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Figure  1 6:  Track  position  from  position  and  class  measurements 


X-Position  (km)  X-Position  (km) 


(a)  Constant  velocity  model 


(b)  Manoeuvring  model 


Figure  17:  Individual  model  track  position  from  position  and  class  measurements 
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associated  to  the  track;  the  use  of  attribute  data  in  the  data  association  will  reduce  the 
likelihood  of  this  occurring.  In  fact,  the  simulations  show  that  the  joint  kinematic  and 
attribute  association  algorithm  of  Section  4.2  [Krieg  2002]  forces  the  track  to  break  at  the 
first  manoeuvre  and  a  new  track  based  only  on  the  radar  measurements  is  initiated,  thus 
overcoming  the  conflict  condition.  To  maintain  a  continuous  track,  kinematic  only  data 
association  is  used  in  this  case. 

As  shown  in  Figure  19,  the  probability  of  class  C2  now  falls  during  the  period  prior  to 
the  manoeuvre,  as  expected  with  class  measurements  supporting  a  class  of  Cl.  However,  as 
the  target  manoeuvres,  the  probability  of  class  C2  rises  rapidly  toward  unity.  In  this  case, 
the  evidence  provided  by  the  manoeuvre  is  more  significant  than  the  class  measurements, 
which  have  only  an  80%  confidence  of  being  correct. 

Finally,  the  effect  of  the  kinematic  state  estimates  on  the  class  probabilities  is  demon¬ 
strated.  Three  targets  are  employed  in  this  scenario,  one  constant  velocity,  one  performing 
a  moderate  manoeuvre,  and  one  performing  a  more  demanding  manoeuvre.  Only  the  range 
and  bearing  measurements  of  SI  are  available,  that  is,  no  class  measurements  are  available. 
Each  target  is  one  of  three  classes,  labelled  1  through  3,  where  each  is  able  to  perform 
manoeuvres  of  increasing  acceleration. 

The  estimated  position  for  each  target  is  shown  in  Figure  20,  with  20(a)  being  the 
composite  track  from  all  three  class  models,  and  20(b)  through  20(d)  being  the  individual 
tracks  for  each  class  model.  The  filter  lag  during  the  manoeuvres  on  the  blue  track 
representing  the  target  with  the  highest  acceleration  manoeuvre  capability  is  evident  for 
class  models  1  and  2,  and  the  same  effect  is  also  evident,  albeit  to  a  lesser  extent,  on  the 
red  track  for  class  model  1.  The  composite  tracks  are  dominated  by  the  constant  velocity 
class  model  prior  to  the  manoeuvres,  and  by  the  appropriate  class  model  during  and  after 
the  manoeuvres.  This  behaviour  is  consistent  with  the  class  estimation,  as  illustrated 
in  Figure  21,  where  21(a)  through  21(c)  show  the  probabilities  for  classes  1  through  3, 
respectively.  The  colours  of  the  lines  in  these  plots  correspond  to  the  colours  of  the  tracks 
in  Figure  20.  Prior  to  the  manoeuvres,  the  probability  of  class  1  increases,  as  no  evidence 
for  the  class  of  each  track  is  available.  However,  the  manoeuvre  provides  class  information, 
with  the  probabilities  for  the  correct  classes  rapidly  rising  towards  unity  in  each  case.  This 
approach  favours  the  model  that  specifies  the  manoeuvres  with  the  lowest  acceleration, 
until  a  target  manoeuvre  with  higher  acceleration  eliminates  those  class  models  specifying 
manoeuvres  with  lower  acceleration.  This  is  consistent  with  Bayesian  approaches  [Ristic 
&  Smets  2005]. 


7  Conclusions 


Bayesian  belief  networks  (BBNs)  are  an  effective  method  for  modelling  the  uncertain 
dependencies  between  continuous  and  discrete  variables.  As  such,  they  provide  a  suit¬ 
able  framework  for  developing  solutions  for  the  target  tracking  and  recognition  problem. 
Algorithms  for  kinematic  and  attribute  data  association  and  state  estimation  have  been 
developed  using  this  framework  and  the  advantages  of  using  both  kinematic  and  attribute 
information  has  been  demonstrated  using  simulated  scenarios. 
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(a)  Single  run  (b)  Mean 

Figure  18:  Probability  of  class  C2  from  position  and  class  measurements 


(a)  Single  run  (b)  Mean 

Figure  19:  Probability  of  class  C2  from  conflicting  position  and  class  measurements 
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X-Position  (km)  X-Position  (km) 


(a)  Composite  (b)  Model  1 


X-Position  (km)  X-Position  (km) 

(c)  Model  2  (d)  Model  3 

20:  Track  positions  for  the  three  target  scenario  with  only  position  measurements 
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Time  (s)  Time  (s) 

(a)  Class  1  (b)  Class  2 


Time  (s) 

(c)  Class  3 

Figure  21:  Class  probabilities  for  the  three  target  scenario  with  only  position  measure¬ 
ments 
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The  use  of  both  kinematic  and  attribute  sensor  data  and  target  states  for  data  associa¬ 
tion  improves  the  ability  of  the  algorithms  to  discriminate  between  multiple  measurements 
and  targets.  The  resulting  improvement  in  data  association  performance,  particularly  in 
dense  target  scenarios,  has  been  shown  through  demonstration  to  improve  kinematic  track¬ 
ing  performance.  The  application  of  attribute  state  estimation  algorithms  that  provide 
accurate  estimates  of  a  target’s  attribute  states  enhances  the  data  association  performance 
and  hence  the  kinematic  tracking  performance. 

A  joint  kinematic  and  attribute  tracking  algorithm  has  been  developed  using  a  BBN 
that  captures  the  dependencies  between  the  target’s  kinematic  and  attribute  states.  It  has 
been  shown  that  the  target’s  class  or  type  can  be  estimated  from  the  kinematic  behaviour 
of  the  target  using  this  technique.  In  addition,  knowledge  of  the  target’s  class  may  im¬ 
prove  the  kinematic  tracking  performance  through  the  selection  of  the  most  appropriate 
kinematic  model  for  that  class. 

Future  work  may  include  the  development  and  evaluation  of  similar  algorithms  for 
different  sensor  mixes,  target  types  and  scenarios.  In  particular,  more  complex  sets  of 
attribute  variables,  including  variables  such  as  platform  type  and  platform  specific  type, 
may  be  developed. 
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